Overview

Dataset statistics

Number of variables35
Number of observations1370
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 MiB
Average record size in memory882.4 B

Variable types

CAT17
NUM16
BOOL2

Reproduction

Analysis started2020-08-25 05:15:05.730181
Analysis finished2020-08-25 05:15:55.476424
Duration49.75 seconds
Versionpandas-profiling v2.7.1
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Possui carro has constant value "1" Constant
Maior de idade has constant value "1" Constant
Horas de trabalho padrão has constant value "80" Constant
Renda is highly correlated with PosicaoHigh correlation
Posicao is highly correlated with RendaHigh correlation
Cargo is highly correlated with DepartmentoHigh correlation
Departmento is highly correlated with CargoHigh correlation
Subordinado has unique values Unique
Quantidade de empresas que trabalho has 180 (13.1%) zeros Zeros
Horas de treinamento ultimo ano has 49 (3.6%) zeros Zeros
Anos na última empresa has 40 (2.9%) zeros Zeros
Anos na posição atual has 225 (16.4%) zeros Zeros
Anos desde última promoção has 536 (39.1%) zeros Zeros
Anos com a mesma gerência has 244 (17.8%) zeros Zeros

Variables

Idade
Real number (ℝ≥0)

Distinct count43
Unique (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.03065693430657
Minimum18
Maximum60
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum18
5-th percentile24
Q130
median36
Q343
95-th percentile54
Maximum60
Range42
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.19652777
Coefficient of variation (CV)0.2483490311
Kurtosis-0.4485800429
Mean37.03065693
Median Absolute Deviation (MAD)6
Skewness0.408571737
Sum50732
Variance84.57612302
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
34 73 5.3%
 
35 72 5.3%
 
31 64 4.7%
 
29 63 4.6%
 
36 62 4.5%
 
30 57 4.2%
 
33 54 3.9%
 
32 53 3.9%
 
38 53 3.9%
 
40 53 3.9%
 
Other values (33) 766 55.9%
 
ValueCountFrequency (%) 
18 8 0.6%
 
19 6 0.4%
 
20 10 0.7%
 
21 12 0.9%
 
22 15 1.1%
 
ValueCountFrequency (%) 
60 5 0.4%
 
59 10 0.7%
 
58 13 0.9%
 
57 4 0.3%
 
56 12 0.9%
 
Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Misto
965
Cliente
263
Escritório
 
142
ValueCountFrequency (%) 
Misto 965 70.4%
 
Cliente 263 19.2%
 
Escritório 142 10.4%
 

Length

Max length10
Mean length5.902189781
Min length5
ValueCountFrequency (%) 
Lowercase_Letter 10 76.9%
 
Uppercase_Letter 3 23.1%
 
ValueCountFrequency (%) 
Latin 13 100.0%
 
ValueCountFrequency (%) 
ASCII 12 100.0%
 

Pontuação teste
Real number (ℝ≥0)

Distinct count853
Unique (%)62.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean807.2496350364963
Minimum102
Maximum1499
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum102
5-th percentile167.45
Q1465.25
median806
Q31168.75
95-th percentile1428.1
Maximum1499
Range1397
Interquartile range (IQR)703.5

Descriptive statistics

Standard deviation404.4006618
Coefficient of variation (CV)0.5009610959
Kurtosis-1.20670036
Mean807.249635
Median Absolute Deviation (MAD)347.5
Skewness-0.01879791784
Sum1105932
Variance163539.8953
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
691 6 0.4%
 
530 5 0.4%
 
1329 5 0.4%
 
1082 5 0.4%
 
329 5 0.4%
 
408 5 0.4%
 
829 4 0.3%
 
950 4 0.3%
 
921 4 0.3%
 
350 4 0.3%
 
Other values (843) 1323 96.6%
 
ValueCountFrequency (%) 
102 1 0.1%
 
104 1 0.1%
 
106 1 0.1%
 
107 1 0.1%
 
109 1 0.1%
 
ValueCountFrequency (%) 
1499 1 0.1%
 
1498 1 0.1%
 
1496 2 0.1%
 
1495 3 0.2%
 
1492 1 0.1%
 

Departmento
Categorical

HIGH CORRELATION
Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Engenharia
896
Vendas
414
RH
 
60
ValueCountFrequency (%) 
Engenharia 896 65.4%
 
Vendas 414 30.2%
 
RH 60 4.4%
 

Length

Max length10
Mean length8.440875912
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 9 69.2%
 
Uppercase_Letter 4 30.8%
 
ValueCountFrequency (%) 
Latin 13 100.0%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

Distancia casa-trabalho
Real number (ℝ≥0)

Distinct count29
Unique (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.105109489051095
Minimum1
Maximum29
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median7
Q314
95-th percentile26
Maximum29
Range28
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.992457375
Coefficient of variation (CV)0.8777991506
Kurtosis-0.1732500948
Mean9.105109489
Median Absolute Deviation (MAD)5
Skewness0.9698216527
Sum12474
Variance63.8793749
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 198 14.5%
 
1 190 13.9%
 
7 83 6.1%
 
10 82 6.0%
 
3 80 5.8%
 
9 79 5.8%
 
8 77 5.6%
 
4 61 4.5%
 
5 59 4.3%
 
6 53 3.9%
 
Other values (19) 408 29.8%
 
ValueCountFrequency (%) 
1 190 13.9%
 
2 198 14.5%
 
3 80 5.8%
 
4 61 4.5%
 
5 59 4.3%
 
ValueCountFrequency (%) 
29 22 1.6%
 
28 20 1.5%
 
27 11 0.8%
 
26 23 1.7%
 
25 23 1.7%
 

Educacao
Categorical

Distinct count5
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Superior completo
535
Superior incompleto - cursando
367
Superior incompleto
262
Médio completo
158
Pós-gradução
 
48
ValueCountFrequency (%) 
Superior completo 535 39.1%
 
Superior incompleto - cursando 367 26.8%
 
Superior incompleto 262 19.1%
 
Médio completo 158 11.5%
 
Pós-gradução 48 3.5%
 

Length

Max length30
Mean length20.34379562
Min length12
ValueCountFrequency (%) 
Lowercase_Letter 19 79.2%
 
Uppercase_Letter 3 12.5%
 
Space_Separator 1 4.2%
 
Dash_Punctuation 1 4.2%
 
ValueCountFrequency (%) 
Latin 22 91.7%
 
Common 2 8.3%
 
ValueCountFrequency (%) 
ASCII 20 100.0%
 

Area
Categorical

Distinct count6
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Ciências das natureza
563
Medicina
433
Marketing
147
Faculdade Técnica
125
Outros
 
77
ValueCountFrequency (%) 
Ciências das natureza 563 41.1%
 
Medicina 433 31.6%
 
Marketing 147 10.7%
 
Faculdade Técnica 125 9.1%
 
Outros 77 5.6%
 
Ciências humanas 25 1.8%
 

Length

Max length21
Mean length14.30437956
Min length6
ValueCountFrequency (%) 
Lowercase_Letter 19 76.0%
 
Uppercase_Letter 5 20.0%
 
Space_Separator 1 4.0%
 
ValueCountFrequency (%) 
Latin 24 96.0%
 
Common 1 4.0%
 
ValueCountFrequency (%) 
ASCII 23 100.0%
 

Possui carro
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
1
1370
ValueCountFrequency (%) 
1 1370 100.0%
 

Subordinado
Real number (ℝ≥0)

UNIQUE
Distinct count1370
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1012.2766423357664
Minimum2
Maximum2055
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum2
5-th percentile106.45
Q1516.25
median1014.5
Q31512
95-th percentile1887.1
Maximum2055
Range2053
Interquartile range (IQR)995.75

Descriptive statistics

Standard deviation569.9466471
Coefficient of variation (CV)0.5630344742
Kurtosis-1.188167388
Mean1012.276642
Median Absolute Deviation (MAD)498.5
Skewness-0.01915818175
Sum1386819
Variance324839.1805
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2044 1 0.1%
 
684 1 0.1%
 
669 1 0.1%
 
671 1 0.1%
 
675 1 0.1%
 
677 1 0.1%
 
679 1 0.1%
 
680 1 0.1%
 
682 1 0.1%
 
683 1 0.1%
 
Other values (1360) 1360 99.3%
 
ValueCountFrequency (%) 
2 1 0.1%
 
5 1 0.1%
 
7 1 0.1%
 
8 1 0.1%
 
10 1 0.1%
 
ValueCountFrequency (%) 
2055 1 0.1%
 
2044 1 0.1%
 
2032 1 0.1%
 
2027 1 0.1%
 
2023 1 0.1%
 
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
3
427
4
418
1
263
2
262
ValueCountFrequency (%) 
3 427 31.2%
 
4 418 30.5%
 
1 263 19.2%
 
2 262 19.1%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Genero
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
M
818
F
552
ValueCountFrequency (%) 
M 818 59.7%
 
F 552 40.3%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Uppercase_Letter 2 100.0%
 
ValueCountFrequency (%) 
Latin 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

Horas voluntariado
Real number (ℝ≥0)

Distinct count71
Unique (%)5.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.80729927007299
Minimum30
Maximum100
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum30
5-th percentile33
Q148
median66
Q383
95-th percentile97
Maximum100
Range70
Interquartile range (IQR)35

Descriptive statistics

Standard deviation20.38990152
Coefficient of variation (CV)0.3098425516
Kurtosis-1.20287588
Mean65.80729927
Median Absolute Deviation (MAD)18
Skewness-0.0359905433
Sum90156
Variance415.748084
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
66 27 2.0%
 
96 27 2.0%
 
84 27 2.0%
 
79 25 1.8%
 
98 25 1.8%
 
54 25 1.8%
 
42 25 1.8%
 
48 25 1.8%
 
87 24 1.8%
 
45 24 1.8%
 
Other values (61) 1116 81.5%
 
ValueCountFrequency (%) 
30 18 1.3%
 
31 14 1.0%
 
32 23 1.7%
 
33 19 1.4%
 
34 12 0.9%
 
ValueCountFrequency (%) 
100 16 1.2%
 
99 19 1.4%
 
98 25 1.8%
 
97 20 1.5%
 
96 27 2.0%
 
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
3
814
2
344
4
 
140
1
 
72
ValueCountFrequency (%) 
3 814 59.4%
 
2 344 25.1%
 
4 140 10.2%
 
1 72 5.3%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Posicao
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count5
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0817518248175184
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.11397386
Coefficient of variation (CV)0.5351136707
Kurtosis0.3368405214
Mean2.081751825
Median Absolute Deviation (MAD)1
Skewness1.006824276
Sum2852
Variance1.240937762
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 502 36.6%
 
1 496 36.2%
 
3 202 14.7%
 
4 104 7.6%
 
5 66 4.8%
 
ValueCountFrequency (%) 
1 496 36.2%
 
2 502 36.6%
 
3 202 14.7%
 
4 104 7.6%
 
5 66 4.8%
 
ValueCountFrequency (%) 
5 66 4.8%
 
4 104 7.6%
 
3 202 14.7%
 
2 502 36.6%
 
1 496 36.2%
 

Cargo
Categorical

HIGH CORRELATION
Distinct count9
Unique (%)0.7%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Vendedo senior
305
Engenheiro
273
Tecnico
235
Supervisor
137
Analista
124
Other values (4)
296
ValueCountFrequency (%) 
Vendedo senior 305 22.3%
 
Engenheiro 273 19.9%
 
Tecnico 235 17.2%
 
Supervisor 137 10.0%
 
Analista 124 9.1%
 
Gerente 99 7.2%
 
Diretor 76 5.5%
 
Vendedor junior 72 5.3%
 
Assistente 49 3.6%
 

Length

Max length15
Mean length10.07445255
Min length7
ValueCountFrequency (%) 
Lowercase_Letter 17 68.0%
 
Uppercase_Letter 7 28.0%
 
Space_Separator 1 4.0%
 
ValueCountFrequency (%) 
Latin 24 96.0%
 
Common 1 4.0%
 
ValueCountFrequency (%) 
ASCII 25 100.0%
 
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
4
430
3
404
1
273
2
263
ValueCountFrequency (%) 
4 430 31.4%
 
3 404 29.5%
 
1 273 19.9%
 
2 263 19.2%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Estado civil
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Casado
626
Solteiro
429
Divorciado
315
ValueCountFrequency (%) 
Casado 626 45.7%
 
Solteiro 429 31.3%
 
Divorciado 315 23.0%
 

Length

Max length10
Mean length7.545985401
Min length6
ValueCountFrequency (%) 
Lowercase_Letter 11 78.6%
 
Uppercase_Letter 3 21.4%
 
ValueCountFrequency (%) 
Latin 14 100.0%
 
ValueCountFrequency (%) 
ASCII 14 100.0%
 

Renda
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count1269
Unique (%)92.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6572.754744525548
Minimum1009
Maximum19999
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum1009
5-th percentile2109
Q12932.25
median4955
Q38437.5
95-th percentile17861
Maximum19999
Range18990
Interquartile range (IQR)5505.25

Descriptive statistics

Standard deviation4755.773452
Coefficient of variation (CV)0.7235586351
Kurtosis0.9002482306
Mean6572.754745
Median Absolute Deviation (MAD)2235
Skewness1.34506851
Sum9004674
Variance22617381.12
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2342 4 0.3%
 
5562 3 0.2%
 
6347 3 0.2%
 
2741 3 0.2%
 
2451 3 0.2%
 
2404 3 0.2%
 
2610 3 0.2%
 
3452 3 0.2%
 
2559 3 0.2%
 
6272 2 0.1%
 
Other values (1259) 1340 97.8%
 
ValueCountFrequency (%) 
1009 1 0.1%
 
1051 1 0.1%
 
1052 1 0.1%
 
1081 1 0.1%
 
1091 1 0.1%
 
ValueCountFrequency (%) 
19999 1 0.1%
 
19973 1 0.1%
 
19943 1 0.1%
 
19926 1 0.1%
 
19859 1 0.1%
 

Bonus de performance
Real number (ℝ≥0)

Distinct count1329
Unique (%)97.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14296.439416058394
Minimum2094
Maximum26999
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum2094
5-th percentile3382.35
Q18009.75
median14225.5
Q320456.25
95-th percentile25431.9
Maximum26999
Range24905
Interquartile range (IQR)12446.5

Descriptive statistics

Standard deviation7122.797449
Coefficient of variation (CV)0.4982217769
Kurtosis-1.21721127
Mean14296.43942
Median Absolute Deviation (MAD)6223
Skewness0.02307073826
Sum19586122
Variance50734243.49
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
4223 3 0.2%
 
9150 3 0.2%
 
21534 2 0.1%
 
10494 2 0.1%
 
15891 2 0.1%
 
13008 2 0.1%
 
16154 2 0.1%
 
6069 2 0.1%
 
19373 2 0.1%
 
2125 2 0.1%
 
Other values (1319) 1348 98.4%
 
ValueCountFrequency (%) 
2094 1 0.1%
 
2097 1 0.1%
 
2104 1 0.1%
 
2112 1 0.1%
 
2122 1 0.1%
 
ValueCountFrequency (%) 
26999 1 0.1%
 
26997 1 0.1%
 
26968 1 0.1%
 
26956 1 0.1%
 
26933 1 0.1%
 

Quantidade de empresas que trabalho
Real number (ℝ≥0)

ZEROS
Distinct count10
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.7036496350364962
Minimum0
Maximum9
Zeros180
Zeros (%)13.1%
Memory size10.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.499332526
Coefficient of variation (CV)0.9244291471
Kurtosis0.02093301721
Mean2.703649635
Median Absolute Deviation (MAD)1
Skewness1.027937213
Sum3704
Variance6.246663077
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 487 35.5%
 
0 180 13.1%
 
3 152 11.1%
 
2 133 9.7%
 
4 132 9.6%
 
7 67 4.9%
 
6 65 4.7%
 
5 58 4.2%
 
9 50 3.6%
 
8 46 3.4%
 
ValueCountFrequency (%) 
0 180 13.1%
 
1 487 35.5%
 
2 133 9.7%
 
3 152 11.1%
 
4 132 9.6%
 
ValueCountFrequency (%) 
9 50 3.6%
 
8 46 3.4%
 
7 67 4.9%
 
6 65 4.7%
 
5 58 4.2%
 

Maior de idade
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
1
1370
ValueCountFrequency (%) 
1 1370 100.0%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Não
992
Sim
378
ValueCountFrequency (%) 
Não 992 72.4%
 
Sim 378 27.6%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 4 66.7%
 
Uppercase_Letter 2 33.3%
 
ValueCountFrequency (%) 
Latin 6 100.0%
 
ValueCountFrequency (%) 
ASCII 5 100.0%
 

Aumento de salario%
Real number (ℝ≥0)

Distinct count15
Unique (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.174452554744526
Minimum11
Maximum25
Zeros0
Zeros (%)0.0%
Memory size10.8 KiB

Quantile statistics

Minimum11
5-th percentile11
Q112
median14
Q318
95-th percentile22
Maximum25
Range14
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.629208052
Coefficient of variation (CV)0.2391656661
Kurtosis-0.2432426366
Mean15.17445255
Median Absolute Deviation (MAD)2
Skewness0.8369406193
Sum20789
Variance13.17115109
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
13 196 14.3%
 
11 194 14.2%
 
12 188 13.7%
 
14 187 13.6%
 
15 94 6.9%
 
18 84 6.1%
 
17 79 5.8%
 
16 75 5.5%
 
19 67 4.9%
 
20 53 3.9%
 
Other values (5) 153 11.2%
 
ValueCountFrequency (%) 
11 194 14.2%
 
12 188 13.7%
 
13 196 14.3%
 
14 187 13.6%
 
15 94 6.9%
 
ValueCountFrequency (%) 
25 17 1.2%
 
24 19 1.4%
 
23 25 1.8%
 
22 47 3.4%
 
21 45 3.3%
 
Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
3
1164
4
 
206
ValueCountFrequency (%) 
3 1164 85.0%
 
4 206 15.0%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 
Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
3
426
4
407
2
284
1
253
ValueCountFrequency (%) 
3 426 31.1%
 
4 407 29.7%
 
2 284 20.7%
 
1 253 18.5%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Horas de trabalho padrão
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
80
1370
ValueCountFrequency (%) 
80 1370 100.0%
 

Length

Max length2
Mean length2
Min length2
ValueCountFrequency (%) 
Decimal_Number 2 100.0%
 
ValueCountFrequency (%) 
Common 2 100.0%
 
ValueCountFrequency (%) 
ASCII 2 100.0%
 

Beneficios
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
0
577
1
564
2
149
3
 
80
ValueCountFrequency (%) 
0 577 42.1%
 
1 564 41.2%
 
2 149 10.9%
 
3 80 5.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Anos de experiencia
Real number (ℝ≥0)

Distinct count40
Unique (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.357664233576642
Minimum0
Maximum40
Zeros10
Zeros (%)0.7%
Memory size10.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q16
median10
Q315
95-th percentile28
Maximum40
Range40
Interquartile range (IQR)9

Descriptive statistics

Standard deviation7.849234237
Coefficient of variation (CV)0.6910958165
Kurtosis0.8260698358
Mean11.35766423
Median Absolute Deviation (MAD)4
Skewness1.105015062
Sum15560
Variance61.6104781
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10 189 13.8%
 
6 115 8.4%
 
8 93 6.8%
 
9 90 6.6%
 
5 84 6.1%
 
1 76 5.5%
 
7 76 5.5%
 
4 56 4.1%
 
12 45 3.3%
 
3 39 2.8%
 
Other values (30) 507 37.0%
 
ValueCountFrequency (%) 
0 10 0.7%
 
1 76 5.5%
 
2 28 2.0%
 
3 39 2.8%
 
4 56 4.1%
 
ValueCountFrequency (%) 
40 1 0.1%
 
38 1 0.1%
 
37 4 0.3%
 
36 6 0.4%
 
35 3 0.2%
 

Horas de treinamento ultimo ano
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8007299270072994
Minimum0
Maximum6
Zeros49
Zeros (%)3.6%
Memory size10.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.282744749
Coefficient of variation (CV)0.4580037284
Kurtosis0.5289010334
Mean2.800729927
Median Absolute Deviation (MAD)1
Skewness0.5630166766
Sum3837
Variance1.645434091
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 511 37.3%
 
3 459 33.5%
 
4 118 8.6%
 
5 107 7.8%
 
1 65 4.7%
 
6 61 4.5%
 
0 49 3.6%
 
ValueCountFrequency (%) 
0 49 3.6%
 
1 65 4.7%
 
2 511 37.3%
 
3 459 33.5%
 
4 118 8.6%
 
ValueCountFrequency (%) 
6 61 4.5%
 
5 107 7.8%
 
4 118 8.6%
 
3 459 33.5%
 
2 511 37.3%
 

Estilo de vida
Categorical

Distinct count4
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
3
837
2
315
4
 
145
1
 
73
ValueCountFrequency (%) 
3 837 61.1%
 
2 315 23.0%
 
4 145 10.6%
 
1 73 5.3%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Anos na última empresa
Real number (ℝ≥0)

ZEROS
Distinct count36
Unique (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.031386861313869
Minimum0
Maximum37
Zeros40
Zeros (%)2.9%
Memory size10.8 KiB

Quantile statistics

Minimum0
5-th percentile1
Q13
median5
Q310
95-th percentile20
Maximum37
Range37
Interquartile range (IQR)7

Descriptive statistics

Standard deviation6.127906824
Coefficient of variation (CV)0.8715075625
Kurtosis3.669894901
Mean7.031386861
Median Absolute Deviation (MAD)3
Skewness1.732534164
Sum9633
Variance37.55124205
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5 183 13.4%
 
1 159 11.6%
 
3 124 9.1%
 
2 117 8.5%
 
10 114 8.3%
 
4 97 7.1%
 
7 87 6.4%
 
8 78 5.7%
 
9 73 5.3%
 
6 68 5.0%
 
Other values (26) 270 19.7%
 
ValueCountFrequency (%) 
0 40 2.9%
 
1 159 11.6%
 
2 117 8.5%
 
3 124 9.1%
 
4 97 7.1%
 
ValueCountFrequency (%) 
37 1 0.1%
 
36 2 0.1%
 
34 1 0.1%
 
33 5 0.4%
 
32 3 0.2%
 

Anos na posição atual
Real number (ℝ≥0)

ZEROS
Distinct count19
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.244525547445256
Minimum0
Maximum18
Zeros225
Zeros (%)16.4%
Memory size10.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile11
Maximum18
Range18
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.617896828
Coefficient of variation (CV)0.8523677824
Kurtosis0.4451941523
Mean4.244525547
Median Absolute Deviation (MAD)3
Skewness0.8981874968
Sum5815
Variance13.08917746
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 347 25.3%
 
0 225 16.4%
 
7 210 15.3%
 
3 120 8.8%
 
4 94 6.9%
 
8 83 6.1%
 
9 66 4.8%
 
1 56 4.1%
 
5 36 2.6%
 
6 35 2.6%
 
Other values (9) 98 7.2%
 
ValueCountFrequency (%) 
0 225 16.4%
 
1 56 4.1%
 
2 347 25.3%
 
3 120 8.8%
 
4 94 6.9%
 
ValueCountFrequency (%) 
18 2 0.1%
 
17 4 0.3%
 
16 6 0.4%
 
15 7 0.5%
 
14 10 0.7%
 

Anos desde última promoção
Real number (ℝ≥0)

ZEROS
Distinct count16
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2065693430656936
Minimum0
Maximum15
Zeros536
Zeros (%)39.1%
Memory size10.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile9
Maximum15
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.220930099
Coefficient of variation (CV)1.459700376
Kurtosis3.521756077
Mean2.206569343
Median Absolute Deviation (MAD)1
Skewness1.963621395
Sum3023
Variance10.37439071
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 536 39.1%
 
1 329 24.0%
 
2 153 11.2%
 
7 74 5.4%
 
4 57 4.2%
 
3 51 3.7%
 
5 41 3.0%
 
6 30 2.2%
 
11 23 1.7%
 
8 17 1.2%
 
Other values (6) 59 4.3%
 
ValueCountFrequency (%) 
0 536 39.1%
 
1 329 24.0%
 
2 153 11.2%
 
3 51 3.7%
 
4 57 4.2%
 
ValueCountFrequency (%) 
15 11 0.8%
 
14 9 0.7%
 
13 10 0.7%
 
12 9 0.7%
 
11 23 1.7%
 

Anos com a mesma gerência
Real number (ℝ≥0)

ZEROS
Distinct count18
Unique (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.141605839416059
Minimum0
Maximum17
Zeros244
Zeros (%)17.8%
Memory size10.8 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q37
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.578048534
Coefficient of variation (CV)0.8639278272
Kurtosis0.1980181544
Mean4.141605839
Median Absolute Deviation (MAD)3
Skewness0.8369400445
Sum5674
Variance12.80243131
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2 323 23.6%
 
0 244 17.8%
 
7 204 14.9%
 
3 132 9.6%
 
8 101 7.4%
 
4 90 6.6%
 
1 68 5.0%
 
9 60 4.4%
 
5 29 2.1%
 
6 26 1.9%
 
Other values (8) 93 6.8%
 
ValueCountFrequency (%) 
0 244 17.8%
 
1 68 5.0%
 
2 323 23.6%
 
3 132 9.6%
 
4 90 6.6%
 
ValueCountFrequency (%) 
17 7 0.5%
 
16 2 0.1%
 
15 5 0.4%
 
14 4 0.3%
 
13 14 1.0%
 

Contratar
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.8 KiB
Não
1178
Sim
 
192
ValueCountFrequency (%) 
Não 1178 86.0%
 
Sim 192 14.0%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 4 66.7%
 
Uppercase_Letter 2 33.3%
 
ValueCountFrequency (%) 
Latin 6 100.0%
 
ValueCountFrequency (%) 
ASCII 5 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

IdadeLocal de trabalhoPontuação testeDepartmentoDistancia casa-trabalhoEducacaoAreaPossui carroSubordinadoSatisfação com o ambiente no emprego atualGeneroHoras voluntariadoEnvolvimento com trabalhoPosicaoCargoSatisfação com empregoEstado civilRendaBonus de performanceQuantidade de empresas que trabalhoMaior de idadeNecessita de hora extraAumento de salario%Performance na entrevistaSatisfação com a relaçãoHoras de trabalho padrãoBeneficiosAnos de experienciaHoras de treinamento ultimo anoEstilo de vidaAnos na última empresaAnos na posição atualAnos desde última promoçãoAnos com a mesma gerênciaContratar
049Cliente279Engenharia8Médio completoCiências das natureza123M6122Engenheiro2Casado51302490711Não2344801103310717Não
133Misto1392Engenharia3Superior incompleto - cursandoCiências das natureza154F5631Engenheiro3Casado29092315911Sim11338008338730Não
227Cliente591Engenharia2Médio completoMedicina171M4031Tecnico2Casado34681663291Não12348016332222Não
332Misto1005Engenharia2Superior incompletoCiências das natureza184M7931Tecnico4Solteiro30681186401Não13338008227736Não
459Misto1324Engenharia3Superior completoMedicina1103F8141Tecnico1Casado2670996441Sim204180312321000Não
530Cliente1358Engenharia24Médio completoCiências das natureza1114M6731Tecnico3Divorciado26931333511Não22428011231000Não
638Misto216Engenharia23Superior completoCiências das natureza1124M4423Supervisor3Solteiro9526878701Não214280010239718Não
736Misto1299Engenharia27Superior completoMedicina1133M9432Analista3Casado52371657761Não133280217327777Não
835Misto809Engenharia16Superior completoMedicina1141M8441Tecnico2Casado24261647901Não13338016535403Não
929Misto153Engenharia15Superior incompletoCiências das natureza1154F4922Tecnico3Solteiro41931268201Sim123480010339508Não

Last rows

IdadeLocal de trabalhoPontuação testeDepartmentoDistancia casa-trabalhoEducacaoAreaPossui carroSubordinadoSatisfação com o ambiente no emprego atualGeneroHoras voluntariadoEnvolvimento com trabalhoPosicaoCargoSatisfação com empregoEstado civilRendaBonus de performanceQuantidade de empresas que trabalhoMaior de idadeNecessita de hora extraAumento de salario%Performance na entrevistaSatisfação com a relaçãoHoras de trabalho padrãoBeneficiosAnos de experienciaHoras de treinamento ultimo anoEstilo de vidaAnos na última empresaAnos na posição atualAnos desde última promoçãoAnos com a mesma gerênciaContratar
136032Cliente238Engenharia5Superior incompletoCiências das natureza119391F4741Engenheiro3Solteiro24321531831Sim14318008234103Sim
136127Misto1337RH22Superior completoCiências humanas119441F5821Assistente2Casado28631955511Não12318001231000Sim
136228Cliente1404Engenharia17Superior completoFaculdade Técnica119603M3221Tecnico4Divorciado23671877951Não12318016224103Sim
136331Misto754Vendas26Superior incompleto - cursandoMarketing119671M6332Vendedo senior4Casado56172107511Sim1133800104310708Sim
136453Cliente1168Vendas24Superior incompleto - cursandoCiências das natureza119681M6633Vendedo senior1Solteiro10448584361Sim133280015222222Sim
136523Misto638Vendas9Superior completoMarketing120234M3331Vendedor junior1Casado17902695611Não19318011321010Sim
136629Misto1092Engenharia1Superior incompleto - cursandoMedicina120271M3631Engenheiro4Casado47872612491Sim14328034342222Sim
136756Cliente310Engenharia7Superior incompletoFaculdade Técnica120324M7231Tecnico3Casado2339366681Não1134801144110998Sim
136850Misto878Vendas1Superior incompleto - cursandoCiências das natureza120442M9432Vendedo senior3Divorciado67281425571Não123480212336301Sim
136950Cliente410Vendas28Superior completoMarketing120554M3923Vendedo senior1Divorciado108541658641Sim133280120333220Sim